Mining association rules for HIV-1 protease cleavage site prediction
نویسندگان
چکیده
Several machine learning techniques, like neural networks, nonlinear support vector machines and decision trees, have been used to model the specificity of HIV-1 protease and to extract specific patterns from peptides cleaved by this protease. Despite many studies, no perfect rules are already known to determine the cleavage of a peptide by HIV-1 protease. These rules are useful for designing specific and efficient HIV inhibitors. Our results show that the technique of mining association rules can find several specificity rules of HIV-1 protease which presents 100% of cleavage probability. Recent papers on this subject show results in which the best rules present cleavage probability ranging from 16% to 91%.
منابع مشابه
Mining HIV protease cleavage data using genetic programming with a sum-product function
MOTIVATION In order to design effective HIV inhibitors, studying and understanding the mechanism of HIV protease cleavage specification is critical. Various methods have been developed to explore the specificity of HIV protease cleavage activity. However, success in both extracting discriminant rules and maintaining high prediction accuracy is still challenging. The earlier study had employed g...
متن کاملA new approach for HIV-1 protease cleavage site prediction combined with feature selection
Acquired immunodeficiency syndrome (AIDS) is a fatal disease which highly threatens the health of human being. Human immunodeficiency virus (HIV) is the pathogeny for this disease. Investigating HIV-1 protease cleavage sites can help researchers find or develop protease inhibitors which can restrain the replication of HIV-1, thus resisting AIDS. Feature selection is a new approach for solving t...
متن کاملFeature Selection Combined with Neural Network Structure Optimization for HIV-1 Protease Cleavage Site Prediction
It is crucial to understand the specificity of HIV-1 protease for designing HIV-1 protease inhibitors. In this paper, a new feature selection method combined with neural network structure optimization is proposed to analyze the specificity of HIV-1 protease and find the important positions in an octapeptide that determined its cleavability. Two kinds of newly proposed features based on Amino Ac...
متن کاملMining Biological Data Using Self-Organizing Map
This paper presents a novel method of mining biological data using a self-organizing map (SOM). After partitioning a set of protein sequences using SOM, conventional homology alignment is applied to each cluster to determine the conserved local motif (biological pattern) for the cluster. These local motifs are then regarded as rules for prediction and classification. In the application to the p...
متن کاملWhy neural networks should not be used for HIV-1 protease cleavage site prediction
UNLABELLED Several papers have been published where nonlinear machine learning algorithms, e.g. artificial neural networks, support vector machines and decision trees, have been used to model the specificity of the HIV-1 protease and extract specificity rules. We show that the dataset used in these studies is linearly separable and that it is a misuse of nonlinear classifiers to apply them to t...
متن کامل